Search CORE

51 research outputs found

An exploration into cognitive bias in ontologies

Author: Keet CM
Publication venue
Publication date: 01/01/2021
Field of study

Ontologies and similar artefacts are used in a myriad of ontology-driven information systems and increasingly also linked to data analytics. Algorithmic bias in data analytics is a well-known notion, but what does bias mean in the context of ontologies that provide a structuring mechanism for, e.g., an algorithm’s or query’s input? What are the sources of bias there, and cognitive bias in particular, and howdo they manifest in ontologies? We examined and enumerated eight broad sources that can cause bias that may affect an ontology’s content. They are illustrated with examples from extant ontologies and samples from the literature. We then assessed three concurrently developed COVID-19 ontologies on modelling bias and detected different subsets of types of bias in each one, to a greater or lesser extent.This first characterisation aims contribute to a sensitisation of bias in ontologies primarily regarding representation of the knowledge

UCT Computer Science Research Document Archive

Natural Language Generation Requirements for Social Robots in Sub-Saharan Africa

Author: Keet CM
Publication venue: IST-Africa Institute and IIMC Ireland
Publication date: 01/01/2021
Field of study

Robots are deployed in Africa mainly in manufacturing, yet they may assist in society as future oriented technologies as well. They may ameliorate, e.g., service delivery issues and skills shortages. In this discussion paper, several uses and use cases relevant to Sub-Saharan Africa are described and requirements identified. We zoom in on human-robot interaction in Niger-Congo B (‘bantu’) languages. Use cases for healthcare and education elucidate specific requirements for the natural language generation component of robots in society. In contrast to typical generation systems, it demands i) combining data-to-text and knowledge-to-text in one system, ii) generating different types of sentences so as to switch between written and spoken language, and iii) processing non-trivial numbers

UCT Computer Science Research Document Archive

Towards a Framework for Meaning Negotiation and Conflict Resolution in Ontology Authoring

Author: Gruetter R
Keet CM
Publication venue
Publication date: 01/01/2020
Field of study

Ontology authoring involves making choices about what subject domain knowledge to include. This may concern sorting out ontological differences as well as making choices of conflicting axioms due to limitations in the logic. Examples are different foundational ontologies in ontology matching and OWL 2 DL’s transitive object property versus qualified cardinality constraints. Such conflicts have to be resolved. However, there is currently only isolated and fragmented guidance for doing so, which therefore results in ad hoc decision-making. This work aims to ad-dress this by working towards a framework dealing with the various types of modeling conflicts through meaning negotiation and conflict resolution in a systematic way. The approach was evaluated with an actual case of domain knowledge usage in the context of epizootic disease outbreak

UCT Computer Science Research Document Archive

ToCT: A task ontology to manage complex templates

Author: Keet CM
Mahlaza Z
Publication venue
Publication date: 01/01/2021
Field of study

Natural language interfaces are a well-known approach to grant non-experts access to semantic web technologies. A number of such systems use simple templates to achieve that for English and more elab-orate solutions for other languages. They keep being designed from scratch in an ad hoc manner, since there is no shared conceptualisation of simple templates and there is no model that is formalised using a Semantic Web language to apply the techniques to itself. We aim to address this by proposing a general-purpose solution in the form of a novel model for templates, formalised as a task ontology in OWL,calledToCT. We used it to develop an ontology-driven text generator for isiZulu, a morphologically-rich language, to test its capabilities. The generator verbalises the TBox of an ontology as validationq uestions. This evaluation showed that the task ontology is sufficiently expressive for the template design, which was subsequently verified with user evaluations who judged the texts positivel

UCT Computer Science Research Document Archive

Assessing and Enhancing Bottom-up CNL Design for Competency Questions for Ontologies

Author: Antia MJ
Keet CM
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2021
Field of study

Competency questions (CQs) are used in ontology development to demarcate the scope, provide insights into their content,and verification. Their use has been impeded by problems with authoring good CQs. This may be assisted by a con-trolled natural language (CNL), but its development is time-consuming when carried out manually. A recent study on data-driven CNL design to learn templates from a set of CQs, resulting in CLaRO,had somewhat better coverage and some noise due to grammar errors in the source CQs. In this paper, we aim to investigate such a bottom-up approach to CNL development for CQs regarding the effects of 1) improving the quality of the source data 2) whether more CQs from other do-mains induce more templates and 3) if the structure of knowledge in subject do-mains has a role to play in the matching of patterns to templates; therewith might indicate that possibly a structure of knowledge in a subject domain may continue to affect bottom-up CNL creation.The CQ cleaning increased the number of templates from 93 to 120 main templates and an additional 12 variants. The new CQ dataset of 92 CQs generated 27new templates and 7 more variants. Thus,increasing the domain coverage had the most effect on the CNL. The CLaRO v2with all generated templates has 147 templates and 59 variants thereof and showed94.1% coverage

UCT Computer Science Research Document Archive

Generating Answerable Questions from Ontologies for Educational Exercises

Author: Keet CM
Raboanary T
Wang S
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2021
Field of study

Proposals for automating the creation of teaching materials across the sciences and humanities include question generation from ontologies. Those efforts have focused on multiple-choice questions, whereas learners also need to be exposed to other types of questions, such as yes/no and short answer questions. Initial results showed it is possible to create ontology-based questions. It is unknown how that can be done automatically and whether it would work beyond that use case in biology. We investigated this for ten types of educationally useful questions with additional sentence formulation variants. Each type of questions has a set of template specifications, axiom prerequisites on the ontology, and an algorithm to generate the questions from the ontology. Three approaches were designed: template variables using foundational ontology categories, using main classes from the domain ontology, and sentences mostly driven by natural language generation techniques. The user evaluation showed that the second approach resulted in slightly better quality questions than the first, and the linguistic-driven templates far outperformed both on syntactic and semantic adequacy of the questions

UCT Computer Science Research Document Archive

An evaluation of template and ML-based generation of user-readable text from a knowledge graph

Author: Dunn J
Keet CM
Mahlaza Z
Poulter M
Publication venue
Publication date: 01/01/2021
Field of study

Typical user-friendly renderings of knowledge graphs are visualisations and natural language text. Within the latter HCI solution approach, data-driven natural language generation systems receive increased attention, but they are often outperformed by template-based systems due to su ering from errors such as content dropping, hallucination, or repetition. It is unknown which of those errors are associated signi cantly with low quality judgements by humans who the text is aimed for, which hampers addressing errors based on their impact on improving human evaluations. We assessed their possible association with an experiment availing of expert and crowdsourced evaluations of human authored text, template generated text, and sequence-to-sequence model generated text. The results showed that there was no significant association between human authored texts with errors and the low human judgements of naturalness and quality. There was also no significant association between machine learning generated texts with dropped or hallucinated slots and the low human judgements of naturalness and quality. Thus, both approaches appear to be viable options for designing a natural language interface for knowledge graph

arXiv.org e-Print Archive

UCT Computer Science Research Document Archive

Dimensions Affecting Representation Styles in Ontologies

Author: A Gangemi
CM Keet
CM Keet
E Mikroyannidi
J Pinggera
M Grüninger
PR Fillottrani
R Hoehndorf
R Stevens
S Walk
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

There are different ways to formalise roughly the same knowledge, which negatively affects ontology reuse and alignment and other tasks such as formalising competency questions automatically. We aim to shed light on, and make more precise, the intuitive notion of such `representation styles' through characterising their inherent features and the dimensions by which a style may differ. This has led to a total of 28 different traits that are partitioned over 10 dimensions. The operationalisability was assessed through an evaluation of 30 ontologies on those dimensions and applicable values. It showed that it is feasible to use the dimensions and values and resulting in three easily recognisable types of ontologies. Most ontologies had clearly one or the other trait, whereas some were inherently mixed due to inclusion of different and conflicting design decisions

Crossref

UCT Computer Science Research Document Archive

A classification of grammar-infused templates for ontology and model verbalisation

Author: A Gatt
B Davis
B Davis
CM Keet
CM Keet
CM Keet
H Safwat
I Androutsopoulos
J Byamugisha
J Byamugisha
JA Bateman
K Deemter van
N Bouayad-Agha
R Stevens
S Rajbhandari
SH Lim
T Kuhn
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

Involving domain-experts in the development, maintenance, and use of knowledge organisation systems can be made easier through the introduction of easy-to-use interfaces that are based on natural language. Well resourced languages make use of natural language generation techniques to provide such interfaces. In particular, they often make use of templates combined with computational grammar rules to generate grammatically complex text. However, there is no model of pairing templates and computational grammar rules to ensure suitability for less-resourced languages. These languages require a modular design that ensures grammar detachability so as to allow grammar re-use across domains and applications. In this paper, we present a model and classification scheme for grammar-infused templates suited for less-resourced languages and classify existing systems that make use of them. We have found that of the 15 systems that pair templates and grammar rules, and their 11 distinct template types, 13 have support for detachable grammars

Crossref

UCT Computer Science Research Document Archive

TDDonto2: A Test-Driven Development Plugin for arbitrary TBox and ABox axioms

Author: CM Keet
K Beck
M Vigo
N Matentzoglu
R Iqbal
S Ferré
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2017
Field of study

Ontology authoring is a complex task where modellers rely heavily on the automated reasoner for verification of changes, using effectively a time-consuming test-last approach. Test-first with Test-Driven Development aims to speed up such processes, but tools to date covered only a subset of possible OWL 2 DL axioms and provide limited feedback. We have addressed these issues with a model for TDD testing to give more feedback to the modeller and seven new, generic, TDD algorithms that also cover OWL 2 DL class expressions on the left-hand side of inclusions and ABox assertions by availing of several reasoner methods. The model and algorithms have been implemented as a Prot\'eg\'e plugin, TDDonto2

Crossref

UCT Computer Science Research Document Archive